Journal of Zhejiang University


  

Advanced Search



41 results found in all.
index Title
16Pre-training with asynchronous supervised learning for reinforcement learning based autonomous driving
Author(s):Yunpeng Wang, Kunxian Zheng, Daxin Tian, Xuting Duan, Jianshan Zhou  Clicked:5440  Download:3582  Cited:0  <Full Text>  <PPT> 1300
Frontiers of Information Technology & Electronic Engineering  2021 Vol.22 No.5 P.673-686  DOI:10.1631/FITEE.1900637
17Minimax Q-learning design for H control of linear discrete-time systems
Author(s):Xinxing LI, Lele XI, Wenzhong ZHA, Zhihong PENG  Clicked:6531  Download:4324  Cited:0  <Full Text>  <PPT> 375
Frontiers of Information Technology & Electronic Engineering  2022 Vol.23 No.3 P.438-451  DOI:10.1631/FITEE.2000446
18Decentralized multi-agent reinforcement learning with networked agents: recent advances
Author(s):Kaiqing Zhang, Zhuoran Yang, Tamer Ba?ar  Clicked:5314  Download:4582  Cited:0  <Full Text>
Frontiers of Information Technology & Electronic Engineering  2021 Vol.22 No.6 P.802-814  DOI:10.1631/FITEE.1900661
19Multi-agent deep reinforcement learning for end–edge orchestrated resource allocation in industrial wireless networks
Author(s):Xiaoyu LIU, Chi XU, Haibin YU, Peng ZENG  Clicked:4045  Download:5124  Cited:0  <Full Text>  <PPT> 744
Frontiers of Information Technology & Electronic Engineering  2022 Vol.23 No.1 P.47-60  DOI:10.1631/FITEE.2100331
20Towards autonomous and optimal excavation of shield machine: a deep reinforcement learning-based approach
Author(s):Ya-kun ZHANG, Guo-fang GONG, Hua-yong YANG, Yu-xi CHEN, Geng-lin CHEN  Clicked:1863  Download:1411  Cited:0  <Full Text>  <PPT> 306
Journal of Zhejiang University Science A  2022 Vol.23 No.6 P.458-478  DOI:10.1631/jzus.A2100325
21Soft-HGRNs: soft hierarchical graph recurrent networks for multi-agent partially observable environments
Author(s):Yixiang REN, Zhenhui YE, Yining CHEN, Xiaohong JIANG, Guanghua SONG  Clicked:1690  Download:2290  Cited:0  <Full Text>  <PPT> 238
Frontiers of Information Technology & Electronic Engineering  2023 Vol.24 No.1 P.117-130  DOI:10.1631/FITEE.2200073
22Optimal synchronization control for multi-agent systems with input saturation: a nonzero-sum game
Author(s):Hongyang LI, Qinglai WEI  Clicked:2040  Download:3327  Cited:0  <Full Text>  <PPT> 298
Frontiers of Information Technology & Electronic Engineering  2022 Vol.23 No.7 P.1010-1019  DOI:10.1631/FITEE.2200010
23Coach-assisted multi-agent reinforcement learning framework for unexpected crashed agents
Author(s):Jian ZHAO, Youpeng ZHAO, Weixun WANG, Mingyu YANG, Xunhan HU, Wengang ZHOU, Jianye HAO, Houqiang LI  Clicked:1754  Download:3823  Cited:0  <Full Text>  <PPT> 290
Frontiers of Information Technology & Electronic Engineering  2022 Vol.23 No.7 P.1032-1042  DOI:10.1631/FITEE.2100594
24Multi-agent differential game based cooperative synchronization control using a data-driven method
Author(s):Yu SHI, Yongzhao HUA, Jianglong YU, Xiwang DONG, Zhang REN  Clicked:1949  Download:3547  Cited:0  <Full Text>  <PPT> 306
Frontiers of Information Technology & Electronic Engineering  2022 Vol.23 No.7 P.1043-1056  DOI:10.1631/FITEE.2200001
25Stochastic pedestrian avoidance for autonomous vehicles using hybrid reinforcement learning
Author(s):Huiqian LI, Jin HUANG, Zhong CAO, Diange YANG, Zhihua ZHONG  Clicked:1510  Download:1826  Cited:0  <Full Text>  <PPT> 260
Frontiers of Information Technology & Electronic Engineering  2023 Vol.24 No.1 P.131-140  DOI:10.1631/FITEE.2200128
26Image-based traffic signal control via world models
Author(s):Xingyuan DAI, Chen ZHAO, Xiao WANG, Yisheng LV, Yilun LIN, Fei-Yue WANG  Clicked:1150  Download:1963  Cited:0  <Full Text>
Frontiers of Information Technology & Electronic Engineering  2022 Vol.23 No.12 P.1795-1813  DOI:10.1631/FITEE.2200323
27Interactive medical image segmentation with self-adaptive confidence calibration
Author(s):Chuyun SHEN, Wenhao LI, Qisen XU, Bin HU, Bo JIN, Haibin CAI, Fengping ZHU, Yuxin LI, Xiangfeng WANG  Clicked:689  Download:446  Cited:0  <Full Text>  <PPT> 121
Frontiers of Information Technology & Electronic Engineering  2023 Vol.24 No.9 P.1332-1348  DOI:10.1631/FITEE.2200299
28A home energy management approach using decoupling value and policy in reinforcement learning
Author(s):Luolin XIONG, Yang TANG, Chensheng LIU, Shuai MAO, Ke MENG, Zhaoyang DONG, Feng QIAN  Clicked:705  Download:463  Cited:0  <Full Text>  <PPT> 169
Frontiers of Information Technology & Electronic Engineering  2023 Vol.24 No.9 P.1261-1272  DOI:10.1631/FITEE.2200667
29A learning-based control pipeline for generic motor skills for quadruped robots
Author(s):Yecheng SHAO, Yongbin JIN, Zhilong HUANG, Hongtao WANG, Wei YANG  Clicked:617  Download:448  Cited:0  <Full Text>
Journal of Zhejiang University Science A  In Press    DOI:
30A multipath routing algorithm for satellite networksbased on service demand and traffic awareness
Author(s):Ziyang XING, Hui QI, Xiaoqiang DI, Jinyao LIU, Rui XU, Jing CHEN, Ligang CONG  Clicked:697  Download:2587  Cited:0  <Full Text>  <PPT> 213
Frontiers of Information Technology & Electronic Engineering  2023 Vol.24 No.6 P.844-858  DOI:10.1631/FITEE.2200507
Journal of Zhejiang University-SCIENCE, 38 Zheda Road, Hangzhou 310027, China
Tel: +86-571-87952783; E-mail: cjzhang@zju.edu.cn
Copyright © 2000 - 2024 Journal of Zhejiang University-SCIENCE